Plant GARDEN How To ▶︎

JA

EN

Nicotiana tabacum

CDS Sequence

CDS ID

lcl|NW_015915740.1_cds_XP_016465503.1_58451

CDS Infomation

[gene=LOC107788337] [db_xref=GeneID:107788337] [protein=THO complex subunit 2 isoform X2] [protein_id=XP_016465503.1] [location=join(7403..7537,7915..8040,10621..10662,16375..16434,16539..16601,17001..17081,18611..18682,18757..18804,18931..18999,19097..19230,20176..20251,20364..20557,26488..26568,34203..34407,34504..34746,34820..34900,34985..35104,35202..35261,52237..52364,55798..55894,55984..56112,58914..59017,59375..59555,60239..60333,60603..61209,61341..61445,61960..62115,62204..62284,62710..63108,64013..65176,66610..66908,67044..67113)] [gbkey=CDS]

Sequence

ATGTCAGTTTTAGGTTTGGAGTTTTTGTACGTTACAGAGGAGTGCATCAAAGAGTTGAAA
AATGGCAACAGCAGCTTCAAGTTCTCTGAACCGCTTCCCACTCTGCGCTTTCTCTATGAG
CTCTGCTCCGTTATGGTTTGTGGTGAATTGCCCATTCAGAAATGTAAAGTGGCATTGGAG
TCTGTGGAGTTTGTGGATTATGCTTCCCAGGAGGAGCTAGGGTCGAGTTTGGCTGATATT
GTTAGCCAAATGGCTCAGGATCTTTCGATGCCGGGAGAAAATCGTCAGCGTCTTATCAAG
CTGGCAAAATGGCTCGTGGAATCTGCTTTGGTTCCTTTGAGATTTCTTCTGGAGCGATGT
GAGGAAGAGTTTTTGTGGGAATCTGAGATGATCAAGATAAAGGCTGCAGATTTGAAGTCG
AAAGAGGTTAGGGTAAACACTCGTCTTCTTTATCAGCAAACAAAGTTTAACCTTCTCCGG
GAAGAGAGTGAAGGCTACGCCAAGCTGGTCACGCTCCTTTCTCAAATGCCAGAGGGTTCT
ATGCAAAATGCTTCAACTGCTACGGTTGGCATAATCAAGTCATTGATTGGGCACTTCGAT
CTGGATCCAAACCGAGTCTTTGATATTGTTTTGGAGTGTTTTGAACATCAGCCTGGTAAT
ACCACATTTTTGGACTTGATTCCCATATTTCCCAAGTCCCATGCCTCCCAGATTCTGGGG
TTTAAGTTTCAATACTACCAACGACTGGAAGTCAATGATCCCGTTCCTAGTGGTCTTTAT
CAGCTAACAGCCTTGCTGGTGAAAAGAGACTTCATTGATGTTGACAGCATTTATGCACAT
TTGCTTCCCAAGGAGGAGGATGCTTTCGACCATTACAATGCATTTTCAGCTAAGAGACTT
GATGAGGCTAACAGAATAGGTAGAATAAATCTTGCTGCTACTGGGAAGGATCTCATGGAT
GAAGAGAAACAAGGAGATGTGACAGTGGATCTGTATGCTGCATTGGACATGGAGACAGAG
GCAGTTGCCGAGCGTTCCGCAGAGCTAGAAAATAGTCAACCCGTGGGCTTGCTTATGGGA
TTTCTTGAAGTGGATGACTGGTATCATGCTCATGTGTTGTTTGACCGTCTCTCACATCTT
AATCCAGCAGAGCATATACAAACATGCAATGGATTATTCAGGCTCATTGAAAGATCAATA
TCTGAACCGTATGACCTTGTTCGCAAGATGCAACTTTTGGGTTTACTTCCTGGAGTCGTC
ACTGACTCTATGGAAGTGGCAAATTCGTCAAGCAGTAGATCTTTTATCAATCTTCCAAAA
GAGCTTTTTGAGATGCTTTCTTCTGTTGGACCTCATCTTTATCGAGATACATTGTTGCTA
CAGAAGGTATGCAGAGTGTTAAGAGGTTATTACATTTGTGCACATCAGCTTGTCGCTAGT
GGTGTGGCAGGTTTTATCTCCCAAACTGTTACAATTGGAGATCAAATTCCTCGTATACAC
CTGAAGGATGCTAGGTCAAGAATTGAGGAAGCATTAGGAGGATGCTTGCTTCCTTCCTTA
CAGTTGATACCAGCAAATCCTGCAGTTGGACTAGAGATCTGGGAACTAATGAATCTCCTT
CCGTATGAGGCACGTTATCGTTTATATGGTGAATGGGAGAAAGATGATGAGCAATTTCCA
ATGCTTTTGGCGGCAAGGCAAACAGCAAAGTTGGACACTAGGAGGATCCTGAAGCGCCTT
GCAAAAGAAAATCTAAAGCAGCTCGGTCGAATGGTTGCCAAACTTGCTCATGCTAATCCA
ATGACAGTATTGCGAACAATCGTTCACCAGATTGAGGCTTACAGGGATATGATTACACCT
GTTGTAGACGCTTTTAAGTATCTGACTCAGCTGGAGTATGATATTTTGGAATATGTTGTT
ATCGAACGGTTGGCACAAAGTGGGAGAGAGAAGCTGAAAGATGATGGTCTTAATTTGTGC
GATTGGCTTCAGTCTTTGGCATCATTTTGGGGTCACCTGTGTAAAAAGTACCCGTCAATG
GAATTGAGGGGCCTTTTTCAGTATCTTGTTAATCAGTTGAAGAGAGGAAACGGTATTGAG
CTTGTGTTCATGCAGGAGCTCATTCAGCAAATGGCTAATGTACACTATACAGAGAACATG
ACAGAGGAACAATTGGATGCCATGGCTGGAAGTGATACTCTACGCTATCAGGCCACCTCA
TTCGGAATAACCCGGAACAATAAGGCATTGATTAAATCGACTAACAGGCTGAGGGATTCT
TTGCTCCCAAAAGATGAACCAATGTTGGCAATTCCACTGTTGCTACTCATTGCTCAACAT
CGTTCTGTGGTAGTTATTAATGCGGAAGCTCCATACATTAAAATGGTCAGTGAACAGTTT
GATAGGTGTCATGGAGCCCTTCTTCAGTATGTTGAATTTTTGTCTAGTGCGGTGACTCCA
ACAGCTGCCTATGCTCTTCTCGTTCCAGCTCTTGATGAGCTTGTACATGTGTATCATCTT
GATCCTGAGGTAGCATTTTTGATTTATCGGCCTGTTATGAGGCTCTTCAAGTGTCAGAGA
AATTCAGATGTCTTTTGGCCTTCAGATAGTGATGAAGCAGTGAGATGGACAGATCTTCTT
GATACCATCAAAACAATGTTGCCTTCAAAAGCCTGGAATAGCTTGTCCCCAGACTTGTAT
GCCACCTTCTGGGGTCTTACACTCTATGATCTTCATGTTCCTAGATCCCGTTACGAGTCT
GAAATTGCTAAGCAGCATGCTGCTCTTAAAGCTCTGGAAGAACTTTCTGATAATTCAAGT
TCTGCAATCACAAAAAGGAAAAAAGATAAAGAAAGAATTCAAGAGTCACTAGATCGGTTA
AGTATGGAGCTTCAGAGGCATGAAGAACATGTTACATCTGTTCGCAGACGACTGACTCGT
GAAAAGGATACATGGCTGAGTTCGTGTCCTGATACTTTAAAGATCAACATGGAGTTCCTT
CAGCGGTGTATATTTCCACGCTGTACGTTCAGTATGCCAGACGCTGTGTATTGCGCCATG
TTTGTTAATACTCTTCATTCCCTTGGAACACCCTTCTTTAACACTGTGAACCACATAGAC
GTTTTGATATGTAAGACAATACAACCCATGATCTGTTGTTGCACCGAATATGAAGTAGGT
CGACTAGGAAGATTTTTGTATGAGACATTGAAGACTGCTTATTATTGGAAGGGTGATGAA
TCAATTTATGAACGTGAATGTGGAAATATGCCTGGATTTGCCGTCTATTACAGATATCCA
AACAGCCAGCGTGTTACATATGGCCAATTTATTAAGGTGCACTGGAAGTGGAGCCAAAGG
ATCACGAGGTTGCTCATACAGTGTCTGGAATCAACTGAGTACATGGAGATCAGAAATGCT
CTTATTTTATTGACAAAGATCTCGAATGTCTTTCCAGTTACTCGGAAGAGTGGAATAAAC
CTTGAGAAGAGGGTTGCCAAAATTAAATCTGATGAAAGAGAGGATCTCAAAGTATTGGCT
ACAGGGGTTGCTGCAGCTCTGGCTTCTAGAAAGCCATCATGGGTGACAGATGAAGAGTTT
GGTATGGGTTACCTAGAACTAAAACCTGCGGCAACCCCCGCTTCTAAATCTTCGACTGTT
AATTCAGTATCCATACCGAATGGGAGTGGCCCTAGTGTTTCTCAAGTTGAGCCTTCTGTT
GGAAGAAGTGTGGCAGCAGGAAGAGTAGTTGATGGCAAGTTGGATAGGCTAGAGAGTTCT
ATGCCAAAACCTGACTTAGGTCAGGTAAAACTGAAATGTAGTCAATCAGTTAATGGACTG
GATTTGCTATCTATGCCATCTGCTGCCCTGCACTCTGGTACTCCAAGTCAAAGACATGTA
GATGAGTTTACGAGTAGACCATTGGAAGAGAATACCATAAAAGCTGCTTCTAAGATGTAT
GGCGAGCAGGAGGGAAGAGCTACACGCAAGCGAGCTGCTCCTGCTGGATCTCTTTCAAAG
CAACAAAAGCATGATATTGAAAAAGACGACAAGTCTGGGAAAGCTGTTGGAAGAGCAACT
GGAGCTACTTATGTTGATGTTGGTCATCCTTCTGAGAAAAGAGCAAGTGGGAATGTCAAT
GTTTTTGCTACAGTTTCAGGGAATGGTAGCTTGTTGTCTGCTGTAGCTAAAAGTGCAGCT
TCATTAATGAGATCACCAGATCTTTCAAGTGAATCGAAAGCAGAACTTGCAGCTACCAAA
TCAGCTGAGCTGAGGTTTTCTGCTGGAAAAGATGATGGCAATGAAAGCTCTGATGTGCAT
AAGCAGTCCTCATCGCGTTTGGTCCATTCGCCTCGGCAGGATGCTTCTAGAGCTAATGAA
AAAGTACAGAAGAGATCTAGCCCCACGGAAGATCTTGATAGACTGAATAAACGCCGGAAA
GGTGAACTTGACAGTAGAGATATTGATGGTGGTGATGTTCGTTCATCTGAGAGAGAGCGG
TTAATAGATGCAAGAGCTGCTGATAAACTTCATGCAGCAGATTATGATAAACATGGATCG
GATGATCAAATATTGAACCGGGCCTCTGAAAAGCCTCTTGATAGATCCAAAGATAAGGGT
GGTGAAAGACATGAGAAAGACCACAAAGAAAGAGTGGACCGTCCTGACAAGTCTCGTGGG
GATGATACGTTGTCTGAAAAATCAAGAGATAGGTCGACAGAGCGGCATGGAAGAGAACGT
TCTGTTGAAAGAGTACTGGAGAGAGGTGCTGATAGGAACTTCGATAGGTTAAGTAAGGAT
GAAAGAATCAAAGATGATAGGAGCAAGCCGCGACATAGTGAAGCATCTGTAGAGAAATCT
CCTACAGATGATCGGTTTCACAATCAAAATTTGCCTCCACCTCCACCACTTCCGCCTCAC
CTGGTCCCCCAATCCATTAATGTAGGCAGAAGAGATGACGATTCTGATAGACGTTTTGGA
ACAGCCAGGCATAGTCAAAGACTTTCTCCAAGGCATGATGAGAGAGAGAGACGGCGATCA
GAAGAGAATAATGCATTATTGCAGGAAGATTTGAAGCGAAGGAGAGAAGAAGATTTTCGA
GATAGAAAGCGGGAAGAAAGAGAACTTCCAATGAAGGTAGAGGAGAGGGAAAGAGAGAGG
GAGAAAGCAAGCCTCGTGAAAGAGGATTTGGATCCAAATGCCTCAAAGAGGCGCAAACTT
AAGAGAGAGCATATGGCTTCAGAACCTGGCGAGTATTCACCCGCTACTCATCCTCCTGCC
CTTTCTATTAATATGTCACAGCCCTATGATGGAAGAGATAGGGGAGAGCGGAAGGGTGTC
ATTGTTCAGCAGCGTCCAGGTTACTTGGACGAGCCAGGTCTTAGGCTTCACGGAAAAGAA
AGTGCCAGCAAAGCACCTCGTCGTGATCTTGACCCTATGTATGACAGAGAGTGGGATGAG
GACAAGAGGCAAAGAGCTGAGCCCAAGAGGCGGCATCGCAAGTAA

Peptide Sequence

PEP ID

XP_016465503.1

PEP Infomation

PREDICTED: THO complex subunit 2 isoform X2 [Nicotiana tabacum]

Sequence

MSVLGLEFLYVTEECIKELKNGNSSFKFSEPLPTLRFLYELCSVMVCGELPIQKCKVALE
SVEFVDYASQEELGSSLADIVSQMAQDLSMPGENRQRLIKLAKWLVESALVPLRFLLERC
EEEFLWESEMIKIKAADLKSKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLSQMPEGS
MQNASTATVGIIKSLIGHFDLDPNRVFDIVLECFEHQPGNTTFLDLIPIFPKSHASQILG
FKFQYYQRLEVNDPVPSGLYQLTALLVKRDFIDVDSIYAHLLPKEEDAFDHYNAFSAKRL
DEANRIGRINLAATGKDLMDEEKQGDVTVDLYAALDMETEAVAERSAELENSQPVGLLMG
FLEVDDWYHAHVLFDRLSHLNPAEHIQTCNGLFRLIERSISEPYDLVRKMQLLGLLPGVV
TDSMEVANSSSSRSFINLPKELFEMLSSVGPHLYRDTLLLQKVCRVLRGYYICAHQLVAS
GVAGFISQTVTIGDQIPRIHLKDARSRIEEALGGCLLPSLQLIPANPAVGLEIWELMNLL
PYEARYRLYGEWEKDDEQFPMLLAARQTAKLDTRRILKRLAKENLKQLGRMVAKLAHANP
MTVLRTIVHQIEAYRDMITPVVDAFKYLTQLEYDILEYVVIERLAQSGREKLKDDGLNLC
DWLQSLASFWGHLCKKYPSMELRGLFQYLVNQLKRGNGIELVFMQELIQQMANVHYTENM
TEEQLDAMAGSDTLRYQATSFGITRNNKALIKSTNRLRDSLLPKDEPMLAIPLLLLIAQH
RSVVVINAEAPYIKMVSEQFDRCHGALLQYVEFLSSAVTPTAAYALLVPALDELVHVYHL
DPEVAFLIYRPVMRLFKCQRNSDVFWPSDSDEAVRWTDLLDTIKTMLPSKAWNSLSPDLY
ATFWGLTLYDLHVPRSRYESEIAKQHAALKALEELSDNSSSAITKRKKDKERIQESLDRL
SMELQRHEEHVTSVRRRLTREKDTWLSSCPDTLKINMEFLQRCIFPRCTFSMPDAVYCAM
FVNTLHSLGTPFFNTVNHIDVLICKTIQPMICCCTEYEVGRLGRFLYETLKTAYYWKGDE
SIYERECGNMPGFAVYYRYPNSQRVTYGQFIKVHWKWSQRITRLLIQCLESTEYMEIRNA
LILLTKISNVFPVTRKSGINLEKRVAKIKSDEREDLKVLATGVAAALASRKPSWVTDEEF
GMGYLELKPAATPASKSSTVNSVSIPNGSGPSVSQVEPSVGRSVAAGRVVDGKLDRLESS
MPKPDLGQVKLKCSQSVNGLDLLSMPSAALHSGTPSQRHVDEFTSRPLEENTIKAASKMY
GEQEGRATRKRAAPAGSLSKQQKHDIEKDDKSGKAVGRATGATYVDVGHPSEKRASGNVN
VFATVSGNGSLLSAVAKSAASLMRSPDLSSESKAELAATKSAELRFSAGKDDGNESSDVH
KQSSSRLVHSPRQDASRANEKVQKRSSPTEDLDRLNKRRKGELDSRDIDGGDVRSSERER
LIDARAADKLHAADYDKHGSDDQILNRASEKPLDRSKDKGGERHEKDHKERVDRPDKSRG
DDTLSEKSRDRSTERHGRERSVERVLERGADRNFDRLSKDERIKDDRSKPRHSEASVEKS
PTDDRFHNQNLPPPPPLPPHLVPQSINVGRRDDDSDRRFGTARHSQRLSPRHDERERRRS
EENNALLQEDLKRRREEDFRDRKREERELPMKVEEREREREKASLVKEDLDPNASKRRKL
KREHMASEPGEYSPATHPPALSINMSQPYDGRDRGERKGVIVQQRPGYLDEPGLRLHGKE
SASKAPRRDLDPMYDREWDEDKRQRAEPKRRHRK

Transcript Sequence

Gene sequence data has not been registered.